Information Capacity

Definition

C=maxpXI(X;Y) \begin{aligned} C &= \max_{p_X} I(X; Y) \end{aligned}

An Interpretation

Each input n-sequence XnX^{n} corresponds to approximately 2nH(YX)2^{nH(Y|X)} possible and equally likely output n-sequences YnY^{n}​.

The total number of possible YnY^{n} is approximately 2nH(Y)2^{nH(Y)}, which has to be divided into sets of size 2nH(YX)2^{nH(Y|X)} for different XnX^{n} sequences.

The total number of disjoint sets is no more than 2n[H(Y)H(YX)]=2nI(X;Y)2^{n[H(Y) - H(Y|X)]} = 2^{n I(X; Y)}. There are at most 2nI(X;Y) \approx 2^{nI(X; Y)} distinguishable sequences of length nn.

by Jon